Learning and discrimination through STDP 
in a top-down modulated associative memory * 

Anthony Mouraud^'^ and Helene Paugam-Moisy^ 

1 - Institut des Sciences Cognitives - UMR CNRS 5015 - 69675 Bron - France 
2 - GRIMAAG - Universite Antilles-Guyane - 97157 Pointe-a-Pitre - France 

February 1, 2008 



Abstract 

This article underlines the learning and discrimination capabilities of 
a model of associative memory based on artificial networks of spiking neu- 
rons. Inspired from neuropsychology and neurobiology, the model imple- 
ments top-down modulations, as in neocortical layer V pyramidal neurons, 
with a learning rule based on synaptic plasticity (STDP), for performing 
a multimodal association learning task. A temporal correlation method 
of analysis proves the ability of the model to associate specific activity 
patterns to different samples of stimulation. Even in the absence of ini- 
tial learning and with continuously varying weights, the activity patterns 
become stable enough for discrimination. 



1 Introduction 

In the scope of pattern recognition and machine learning, artificial neural net- 
works have proved to be computationally efficient tools. However classical learn- 
ing methods and connectionist models have shown several limitations, e.g. fast 
adaptation to changing environment, or modelling temporal binding and syn- 
chronization phenomena for multimodal integration [1] [5] . 

Introduced in ^3] and extensively described later [11, spiking neurons use 
precise timing of spike emissions as relevant neural code, in opposition to the 
rate coding of the first two generations of neural networks [5] . Spike time cod- 
ing increases the speed of image processing f6j and the computational power of 
neural networks [7]. However learning in spiking neuron networks (SNNs) is not 
yet controlled for performing general purpose discrimination tasks efficiently. A 
first track is to exploit neuroscientist knowledge on synaptic plasticity mecha- 
nisms, like Spike-Timing-Dependent Plasticity (STDP) [HKH], that can be easily 
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implemented in SNNs as unsupervised dynamic hebbian learning rule for on-line 
adaptation of weights. 

A second idea is to incorporate attentional mechanisms in the model, starting 
from an architecture of bidirectional associative memory already proven efficient 
for pattern recognition [TD]. It is largely approved that top-down processing is 
involved in attention [TT], which is fundamental for efficient learning. From 
physiological studies [12], we derive that neocortical layer V pyramidal neurons, 
able to integrate bottom-up and top-down signals, are well suited for modelling 
top-down influences in the network. 

Only few work try to associate SNNs and STDP in pattern recognition tasks, 
and none of them study the influence of top-down modulations in such tasks. 
In this article we present a three layer multimodal associative memory coupling 
models of spiking pyramidal neurons and interneurons in a STDP driven learn- 
ing and discrimination task. We show that the model is able to dynamically 
associate specific patterns of activity to bimodal stimulations and we study the 
influence of top-down modulations on learning speed and discrimination. 

2 Multimodal bidirectional associative memory 

Neuron model and network architecture Neurobiological studies have 
shown that neocortical layer V pyramidal neurons have specific abilities to in- 
tegrate distal (layers I and II) top-down inputs and proximal (thalamic, layers 
V and VI) bottom-up inputs thanks to a dendritic and axonal action poten- 
tial (AP) initiation sites [131 [T^]. Hence we retain neocortical layer V pyramidal 
neurons as basic model of neuron (Figure [T^) for our implementation of bimodal 
bidirectional associative memory. 

Based on a classical BAM architecture (Bidirectional Associative Memory), 
adapted for multimodal association [TOj , and also inspired from the architecture 
in [14], we define a network (Figure [Bd) of three layers with 100 excitatory and 
inhibitory neurons each. Sensory stimulations are received on the two perceptive 
layers, one for each modality. The associative layer plays a role of data fusion 
and gives the output pattern. 

Neural dynamics and interconnections Excitatory neocortical layer V 
pyramidal neurons are coupled with inhibitory interneurons for dynamically 
modulate the amount of activity in the network (for safe of clarity, inhibitory 
interneurons are not represented on Figure IDd) . The neurons are modeled in 
spike time coding, on the basis of the Spike Response Model [4] . At each time 
t, the membrane potential Uj of a postsynaptic neuron j depends on the spike 
( f) 

emission times tl of every presynaptic neurons i, weighted by w.y, and on its 

own past emission times t^f \ whith i^^* denoting the firing of neuron j, 
according to the equation 

ujit) = E ^(^ - if) + EE - ^i^^) (1) 
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Figure 1: a) Neocortical layer V pyramidal neuron model, b) Architecture of the net- 
work. Pyramidal neurons (triangles) have two distinct dentritic sites : basal (bottom 
horizontal line) and distal (upper horizontal line), c) STDP excitatory and inhibitory 
windows. Weight variation functions fi^ for At < and Vl'^ for At > 0. 



where 77(t) is the hyperpolarisation kernel of neuron j and e(i) is the postsynaptic 
response kernel. For excitatory pyramidal neurons only, if top-down influences 
are activated, an extra term 'YliCaP^ ~ added to the righthand side 
of equation ([T]), where p(t) is the response kernel to calcium action potentials 
CaAPs emitted at 1^^°''' from the distal tufted dendrites emission site towards 
the somatic initiation site [12 (Figure . 

The emissions of CaAPs (spikes) are function of both the depolarization of 
distal dendritic site of neuron j via backpropagated sodium action potentials 
NaAPs and distal dendritic excitation above a threshold 8ca via pre-synaptic 
spikes [l^ . The coincidence of backpropagated NaAPs and 0ca crossing gives 
rise to CaAPs, thus facilitating a spike emission for neuron j. 

The two perceptive layers (Figure [TJ)) receive sensory inputs on the basal 
dendritic site of the excitatory pyramidal neurons. Each pyramidal neuron 
makes a synapse to the basal site of the associative layer neurons, which in 
turn are connected to distal dendritic sites of the perceptive neurons, without 
layer distinction. Hence, perceptive pyramidal neurons compute the integration 
of top-down (associative) and bottom- up (sensory) inputs. At each pyramidal 
neuron in each layer we associate one strongly connected interneuron which 
makes inhibitory synapses with every other pyramidal neurons of the same layer. 

Model of synaptic plasticity The weight Wij from a presynaptic neuron i to 
a postsynaptic neuron j is modified at each spike emission time t^re or t^ost ■ Let 
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At = tpost — tpre be the time difference between the last post and presynaptic 
spikes and Acuij the weight variation of synapse subject to the bounds 

and 1, or— 1 for inhibitory synapses. We chose the foUowing multiphcative rule 
for synaptic plasticity 



where fl and fi"*" represent the linear weight variation functions of Figure [TJ;. 

3 Bimodal association task 

Bimodal stimulation and output A set of bimodal stimulations is a set S 
of pairs s; = {sj,sf) where sf is the input vector of perceptive layer k. In the 
present article, the network has been tested on a set S — {si...sio} of 10 pairs of 
characters represented by 10 x 10 matrices of black or white pixels. Associations 
to be learned are ('A','l')i ('BV2') and so on. An input pattern si is presented 
repetitively 10 times to the network, during 1ms every 10ms. Each 10ms time 
slot, a black pixel sf (j) causes a spike emission of the pyramidal neuron j on 
perceptive layer k. No spikes are caused by white pixels. The output oi of the 
network is the spiking activity pattern of the associative layer recorded along 
the 100ms presentation of a bimodal stimulation sj. Output can be represented 
by a matrix of 100 (# associative neurons) xlOO (# elementary 1ms time slots) 
binary values: 1 for an output spike emission, otherwise. 



Figure 2: Stimulation protocol. Phase 1: Learning. Phase 2: Discrimination. 

Protocol The objective is to evaluate the ability of the model, first to repro- 
duce a learned activity fc; for each bimodal stimulation s/ , second to discriminate 
pairs of stimuli via different output activities o/ for Z e {1 . . . 10}. The experi- 
mental protocol consists in two successive phases of stimulation. Starting from 
random weights, in phase 1 the network adapts all the connections by STDP 
through 10 repetitive presentations of a sequence of all the input patterns, in a 
given order (Figure[2|). Phase 1 lasts 10s and is followed immediately by phase 2 
where each pattern si is again presented 10 times to the network (100ms for each 
presentation), but the order of the s; is random and varies from one sequence to 
the next. All along the protocol, the weights of the network still continue vary- 
ing through STDP learning. However phase 2 is mainly designed for testing the 
ability of the model to recall and discriminate all the patterns, independently 
of the sequence order imposed for learning stabilisation in phase 1. At each 
step of presentation of 5*, oi{p) is the output activity pattern of the network 
for stimulation s/ at step p {p — 1 to 10 in phase 1, p — 11 to 20 in phase 2). 
The learned activity ki = o;(10) is defined as the activity pattern for s; at the 
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end of phase 1. The same experimental protocol is applied in two conditions: A 
network without top-down modulations and a network with CaAPs top-down 
modulations. 

Analysis The matricial representations of the network outputs Oi are com- 
pared by the method of template correlation analysis, as defined in [15] . The 
method produces a template correlation coefficient, ranging between —1 and -1-1, 
which indicates the overlapping of two matrices, and then the similarity between 
two activity patterns. First we have studied, for all the stimulations s;, the cor- 
relation of the successive outputs Oi(p)i<p<io in phase 1 to the learned activity 
ki, in both conditions (Figure [3] left). Second we have computed the average 
correlations between the learned activity ki and all the outputs oi{p)ii<p<2Q in 
phase 2, either for the same pattern, k ~ I, or for all the other stimulations, 
k ^ I (Figure [3] right, "same" or "different"). 



Leaming Discrimination 




■ Without CaAPs 

steps 

Figure 3: Learning and discrimination curves. Correlations between outputs in two 
conditions: Network without top-down modulations and network with CaAPs influ- 
ences. 



4 Results and conclusion 

The experimental protocol has been applied to five networks with different initial 
weights. Figure [3] presents the mean values of correlation coefficients, computed 
on all the stimulations and all the networks, for learning (Student test) and 
discrimination (ANOVA), without and with top-down modulations. The values 
that are globally significant w.r.t the others are marked by a star. 
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Figure [3] (left) shows that the correlation coefficients quickly increase in the 
first steps of learning phase 1 and reach values close to 0.9. That proves a 
good stability of the network output for every stimulations learned by STDP, 
even in the absence of initial or supervised learning. The influence of top- 
down modulations is clearly positive: The outputs are better correlated with 
the learned activities, both in strength and time. 

In discrimination (Figure [3] right), the mean correlation coefficients measure 
the global ability of the model to discriminate patterns during the recall phase 2. 
Although the weights continue varying by STDP adaptation, learned activities 
are efficiently reproducible as specific outputs for the learned stimulations, even 
presented in random order. In both conditions, the correlation with learned ac- 
tivities is significantly higher for the same pattern (case same) than for all the 
other ones (case different). Top-down influences improve the correlation (close 
to 0.95) but reduces the difference between the same and different cases. 

The results show a stable activity obtained for each bimodal stimulation 
Si G S, reproducible during task and significantly different for each stimulation. 
We also show that top-down modulations can significantly increase the stability 
and reproducibility, with the consequence of reducing the differences between 
stimuli specific responses. Our observations are coherent with the behaviour 
of general learning systems: High learnability power (e.g. VC-dimension) is 
not always suitable for good generalisation. However, in the framework of our 
experiments, our model of top-down modulated associative memory has both the 
abilities of fast learning (4 or 5 steps of STDP are sufficient for convergence) and 
good discrimination (correlation with different learned activities is significantly 
lower than correlation with the stimulus specific response). 
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